Fast Query Processing Using Cooperative CPU Caching for Index Structures
نویسندگان
چکیده
Index data structures are an important component of database applications. Our measurements show that for a memory resident CSB+-tree (Cache Sensitive B+ tree), the cache miss penalty now accounts for over 50% of the total running time. This is exacerbated by current trends. As a result of Moore’s law, both the CPU speed and the DRAM density have doubled every 18 months. However the cache miss penalty for the CPU has increased with today’s longer cache lines. Clusters provide a cost-effective alternative using commodity equipment. We explore a cache conscious index solution for a cluster. We propose a novel approach for fast query processing that aggregates caches over multiple CPUs. We call this cooperative CPU caching. The idea is to decompose the memory-resident sorted array into partitions so that both a partition and an index over that partition can fit into the L2 cache of one CPU. Then we do parallel query processing, in which we trade off cache misses on a single computer for network communication speeds. We evaluate three alternative strategies to query over a cache-resident sorted array. We compare our method using a cluster of n CPUs with the alternative of n independent CPUs, each processing a separate stream of queries. On a standard cluster, our approach is shown to be approximately 50% faster. By managing the aggregate CPU caches, we achieve super-linear speedup. A cluster of 10 CPUs can be more than 10 times faster than a single CPU. In both experimental and theoretical evaluations, we show that our approach outperforms other approaches.
منابع مشابه
Ultra-Fast Image Reconstruction of Tomosynthesis Mammography Using GPU
Digital Breast Tomosynthesis (DBT) is a technology that creates three dimensional (3D) images of breast tissue. Tomosynthesis mammography detects lesions that are not detectable with other imaging systems. If image reconstruction time is in the order of seconds, we can use Tomosynthesis systems to perform Tomosynthesis-guided Interventional procedures. This research has been designed to study u...
متن کاملCross-Layer System for Cluster Based Data Access in MANET’S
The objective of cooperative caching is to improve data availability, improve access efficiency and reduce query delay in mobile Ad-Hoc networks. Many types of cache replacement algorithms like LRU, LFU, LFRU, LRU-MIN and LFU-MIN are used to improve data accessibility and reduce query delay in cluster based cooperative caching in Mobile Ad-Hoc networks. But they have some limitations such as ac...
متن کاملCooperative caching for grid-enabled OLAP
In this paper, we propose a grid-based On-Line Analytical Processing (OLAP) application which distributes query computation across an enterprise grid. Our application follows a two-tiered process for answering queries based on sharing Cached OLAP data between the users at the local grid site and using grid scheduling approaches to execute the remaining parts of a query amongst a distributed set...
متن کاملDescriptive Name Services for Large Internets
This thesis addresses the challenge of locating people, resources, and other objects in the global Internet. As the Internet grows beyond a million hosts in tens of thousands of organizations, it is increasingly difficult to locate any particular object. Hierarchical name services are frustrating, because users must guess the unique names for objects or navigate the name space to find informati...
متن کاملCaching Schema Information and Intermediate Results for Fast Incremental XML Query Processing in RDBSs
Many index structures and algorithms have been proposed for efficient query processing in XML databases. However, experience with view-based query answering in RDBSs shows that the incremental evaluation based on cached query results can substantially improve the performance compared to the evaluation from scratch. The main problems related to caching are: (1) to determine which cache entries c...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره cs.DC/0410066 شماره
صفحات -
تاریخ انتشار 2004